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2 142 1. Provision Of Clinical Data 142 

?. Defer ml rial I or, i 3. Correlation Of A 

With Drug Response 143 a SuiicMJo Procedure (To 4 SKP Cm:!; 143 b. 

Pare-Down Procedure (To 10 SNP Limit).,...,..,..., 145 c. Cos! Reduction 146 J. 

REFERENCES .' 147 II. ABSTRACT OF THE INVENTION 212 CLAIMS 

We claim. 1 . A method of generating a haplotype database for a population, comprising data elements representative of 
i I , , , ' ' ii ( ^ l< i' ' - t c ; r n v 1 „ x livid lai 

- x > ! t - gene filing polymorphism and haplotype data cic™" " .- ^ ^ ^ i i>' 1 h" 1 "C /ifj? P> only no p mi-* 
and hapiotypes for the locus; and 1) (b) storing the polymorphism and haplotype data elements c dividuais in a 
computer-readable database, wherein the data elements are organized according to the spatial relationships between the 
polymorphisms and hapiotypes and a reference nucleoside sequence for the locus 

2. The method of claim 1 , wherein the locus ss a gene or a gene feature and the haplotype data elements represent 
haploty; i rs for thee 

3. The method of claim 2, wherein ihe deriving step comprises ascertaining the frequency of the hapiotypes arid haplotype 
pairs ac -ok | \ 

4 > met hoc ciai n 2, fur he x rip it, t /: (a) cx xj a nucleotids 
sequence of the gene or the gene feature from a first chromosome and a second chromosome in each Individual in the 
population to generate a plurality of nucleotide sequences for the population; (c) aligning the plurality of nucleotide 
sequences for the population -kit. i °. 10m the aligned st x noes, a - e) selecting two hapiotypes for 
- ; non d <ai . t , , c ' c 

5. The method of claim 4, wht eln ihe meshed turilx . j 3 t i 

6. The iret^cd o* < lai i g comp! ses corret. t L i distribution of hapiotypes or haplotype 
pairs for effects imposed by a limited number oi individuals in the population. 

r l l also comprises analyzing compliance of the observed distribution with 

fvlendelian Inheritance principles. 

8. The method of claim 1 wl mij t i j jul.it jri, a clinical 

peulit >n .id >i je>piioi,li (i - e 'n, i x . <. - - [i en, <r c tenia on 

5 A ne id oi pred ; -he prs oi ; pe p c 3: (a) identifying 3 genotype lor th< 
individual: , enumerating aii possible haplotype pairs which are consistent , tlx: genotype; i accessing a r it/ 
containing i"t i ' k "L \ f i - 
Hi I f >( lo, i. i ill \. i t i. 

for the individual, 

10. The method of claim 9, wherem the identifying step comprises deteirr n»ig i ■ - ' » , " rt 

1 1 . The method of claim 1 0, wherein the determining includes calculating phyiogenetic and/or linkage information for ihe 
reference haplotype pairs. 

12. The method of claim 1 0, wherein the enumerating step comprises listing the possible haplotype pairs in order of their 
frequency In the database 

13. A method for identifying a correlation between a haplotype pair and a clinical response to a treatment, or other 
>henotyp it I i ece s i x, a dak j< nia tu t t [-i h 
phenotypes, exhibited by a clinical population; (b) select ng a candioa -> ft r ! i i ed with the 

i ~ ~ ^a.. t - • - „r = da a or 

each member of the clinical population, ihe haplotype data comprising information on a plurality of polymorphic sites 



present in ;he candidate locus: (d) storing the haplotype data: and (s) calculating the degree of correlation between 
haplotype u C i her phe ^otype. by statistically analyzing the haplotype and 

clinical response data. 

14. The method of claim 13 wherein step (e) is performed last. 

15. The method of claim 13 wherein step (a) Is performed before any one of steps (b), (c) or (d). 

16. The method of claim 13 wherein step (a) is performed after steps (b), (c) and (d). 

1 7. The method of any one of claims 13-18. wherein the treatment comprises administration of a drug or drug candidate. 

18. The method of claim 17, wherein the candidate locus is a gene or a gene feature. 

19. The method of claim 1 3, further comprising displaying or ouiputting the correlation. 

20. The method of claim 19, further comprising calculating the statistical significance of the correlation. 

21 . The method of claim 20, wherein the providing haplotype data step comprises (a) providing a genotype for the 
indrvidu ji; (b) enumeral } all possible haplotype pairs which are consistent with the genotype: (c) determining a 
probability for each possible haplotype pair ihal the individual has thai possible haplotype pair, by accessing a database 
containing frequency data for haplotype pairs in a reference population; and (d) analyzing the determined probabilities to 
infer the individual's haplotype pair. 

22. A method for identifying a correlation between a haplotype pair and susceptibility to a condition or disease of interest, 
or other phenotype of interest, comprising the steps of : (a) selecting a candidate locus hypothesized to be associated with 
the phenotyf. ^ ^ it least two poly n ...... <. i u r oiovidmg 

' joi %|.e d it, lui h ... v . ^ u ' N vC : d! c n y diet a c < 

interest (''disease haplotype data"); ic) organizing the disease haptotype data In a database: (d) statistically analyzing the 
n r rt [ , - \nplr- y0 e data for the 

candidate locus for each member of , 

the reference haploi/o [ i hi 

frequency in the population having the phenotype, condition or disease of Interest than in the healthy reference population, 

identifying a correlation of th« h . ^ep* uility to the disease or condition of interest. 

23. The method of claim 22 wherein step (f) is performed after step (d). 

24. The method of claim 22 wherein step (e) Is performed before any one o? steps (b), (c), or id). 

^ ) >r (d). 

26. The method of any one of claims 22-25, wherein (he candidate locus Is a gene or a gene feature. 

28. The method of claim 27, further comprising calculating the statistical significance of the Identified correlation. 

29 ^-e met- r'»i i * < ""i if providing haplotype data step comprises: (a) providing a genotype for the 
individual (hi <=run Q : nt e geno y pe L m each possible 

haplotype pair, determining i a . 1 1 fhat the individual has thai n i L / pair, by . ; a database containing 

frequency data for haplotype pairs in a reference population; and (d) inferring the individual's haplotype pair based on the 
i 5d p ^fcac i ties 



30 -> lethod prec dual's response to a medical or pharmaceutical treatment, comprising: SaS selecting at 
least one candidate gene for which a correlation between hapiofype content and response to the treatment has been 
went! ltd f. 1 (3 it 11 \ h if c h nil t I t d i oadkting 1 h at the 

the correlation. 

31 ie method of claim 3C era - ig step c< uprises outputling a list oi candidate genes associaled wilh 
different respc ises to iht eatment. 

32. The method oi claim 31, further comprising storing the haplotype pair. 

33. The method of claim 32, furiher including generating an error estimate. 

34. A computer implemented method for generating a gene structure screen for display on a display device, comprising 
he step >' ^ , . , v< if o> , \m of occurrence 
of 3 gc x s hap i ber groupings ol a reference population; (b) retrieving from a database 

a ndicative of the frequencies of occurrence of particular nucleotides for the member 
groupings; (c) retrieving from a database data indicative of gene £ net ; (d ^splaying in a third area a graphical 
representation of gene structure c i identifies polymorphic Sites en the gene. ;et selecting one of the t i t >h k > to 
cause the appropriate nucleotide frequencies to be displayed in the second area. 

35. 1 n it t i >l i i v t 
comprising the steps of : (a) displaying in a first 3roa a piurahty c ^eloctnnlr t em^ eai,h 1 < v i\>ohi> 
site for a predetermined gene; (b) selecting one or more of said selectable Items; (c) displaying in a second area the 
haplotype pair oc jrnng i i n j jlatk n or the c i I i die splaying in a third area data 
indicative of haplotype frequencies for a plurality of member groupings within the population. 

35. A 1 1 it implemented method for generating a linkage sereer; >i display on a display device, compiling 1 sleps 
o; : (a) displaying In a re area a graphical scale showing a reference for defer mining progressive degrees of linkage 
between polymorphic sites in a population; (b) displaying in a second area a graphica r * struct having a plurality of 
gods, where each axis of the structure represents polymorphic Sites; on a gene; and where each grid graphically display;; 

-i indication oi deg rdaace with i< rs i n 

shown in the first area, 

37. The method o! claim 36, wherein color is used as the indication of degree of linkage, 

W A corn ju i c 4 1 li I j , 

ems each corresponding to a polymorphic site for a 
predetermined gene; (b) selecting one or more of said selectable Items; (c) displaying in a second area a phylogeneiic tree 
struc ure b >. j t <. i 11 

of nucleotides thai would have ;o be flipped to change one haplotype into another. 

39. The method of claim 38. wherein the nodes are connected by links that indicate a single nucleotide difference between 
nodes. 

tJ Ji j! , a ndicatic of ethnogeographic frequency of occurrence of 

41 . A con Implemented method for generating a genotype analysis screen for display or; a display device, 
comprlsmg the steps o? : [3) displaying a first plurality of selectable items each corresponding to a polymorphic site, and a 

bie items each corresponding to a poly mc 
reference for deteim nin qet^ cade _^ let „a a w ^lcLili pi g (cj disomy! •■g a 

graphical matrix structure having a plurality oi grids, where each axis represents a haplotype indicated by the first 

' , ■• - y displays an indication of degree of identification reliab 
the haplotype correspo d ig • at grid sing specified by the second selectable items, in accordance with the 



42. I he method of claim 41 , wherein the indication of degree is color. 



43. A method ot displaying clinics! response values of a subject population se s function of haplotype pairs of the 
individuals in the popr a) ing from a compi £ c presenting 

A f. . r 4" > .i .'a' the subject populatm ^ ploiype pair matrix 

each tit ti c HI nl } / hi A < t >n £ f t x ^ dna x v ' -, ie ' solotype 

pair corresponding to that coll of the haplotype pair matrix, 

44. A method ' d ' , ' clinical ' p< - " valuer; of a >u > , x >, t ; as . function 'i m > c v pairs of the 
individuals in r r t o comprising- I displaying one t more firs; selectable items representing polymorphic sites 

ill i m \ a „ f i J o _/ Stf n-o sde^iabif- ikm 

representing a clinical response measurement; which, when sete 1 njuro « t t v -1. ems will 

cause display oi a haplotype pair matrix, each of whose cells contain:; a graphical representation of the clinical response 
values for the selected clinical measurement of individuals having the haplotype pair corresponding to that cell of the 
haplotype pair matrix. 

45. The method of claim 43 or 44, wherein the graphical representation of clinical response values is a color scale or gray 
scale, the shade of each cell being proportional to the mean clinical response value of individuals having the haplotype 
pa corresj ondi u N m 

4S. The method ot claim 45, turther comprising displaying a means tor adjusting the range of mean ciimcal response 
values represented by the color scale < gray scale, wherein adjustment of fi range causes the displayed shade of color 

■} <\ ' " ' , , , 1 - V , 

i " a x " ~ , m v r Hi ir s 

individuals across the range of clinical response values. 

48. The method o; any one of cislms 43,44, or 45 wherein af feast one cell includes a selectable ares which, when 
selected, will cause the display of a histogram indicating the distribution of individuals across the range of clinical response 
values. 

49. The method of any one of claims 43,44 or 45 which further comprises displ w ch vrhen 
ft i 3 ! di dual polymorphic 

sites and the clinical response values. 

50. The method o? claim 43,44 or 45 which fur h< ap t , , I < plays 

- xm and standard deviation of clinical response values among individuals having each haplotype pair in 

51. The method of claim 43,44 oi 45 i:h i, In n m I <- 3i 

he display o c calcuisl - ana a clinical 

response values between inc »idi a 3 "a g u fee t hajt - J 1 i 

52. A computer-implemented method for carrying out a genetic algorithm for finding an optima! set of weights to fit a 
function of po! i o _ Ji I ^ _ t r-c \ x I i 
>t ti i n } the 1 v ^ , , i t !< H f 

of agents paramo ^ ^ - r x J' - oiior for setting the ' - - , , ^ e , ; ij ng a variable 

controller for setting the crossove ate par; c splaying one or more se k 1 copending to a 

polymorphic site of a predetermined gene; and (f) displaying a selectable stem for initiation of the genetic algorithm 
calculation; wherein selection of one or more selec spondinc tc a polyene oh c a te and selection ot fhe 

t i c i 1 v. 5 :> r exec ion o he? ; on t 

r v ■> II ^ ^ I -nor of the model as a function of the number of 

he results ol tho g me >p me o,e jhts 

breach of the polymorphic sites. 



1 "V 3t v „ 

comprising 2)'. apiay ; first plurality of selectable item_ r. < c *. bles 31(0' 

displaying a econd plurality rt selectable i ponding to 1 1 I 14 > displaying a 

scatter plot of daia points corresponding ;o the individuals in the selected pop ^ ' f > i em from 

the first plurality of selectable items causes each daia point to be plotted on the x axis of the scatter plot according io the 
vaiue of ° eerrecporiCmg oilmen! outcome value for the individual associated with the data pomp and therein selection 
of a second m i i second piuiafiry of selectable items causes each data ^ to t;e plotted on the y axis of tin 
cotter rbt t r c ; r - e - n - v e - - i , v ~ - t d 

point. 

54. A method iciing a clinical f comprising: (a) 

1 o m or expected to be involved in a particular disease or drug response: (b) 

1 r f n. f. ... jitnd, (o) sequencing 

DMA from each member or the reference population; (;;'■ determining the haplotypes tor each of the selected genes (or 
oihe be oif\"m.n - - - - . x 

statistical nea l , \ ii t . jlimts for each of t r >. 1 ^ n \i\ in j 

individuals who have the medical condition 01 Interest. ( 1 treating individuals lii ;he trial popuiarion according to rhe 
treatment jn v - ,r 3 jn t f <- t 1 n 1 1 m < it | 1 in k f i 

genes (or other loci) for each member of the trial population: (i) determining the correlations between individual responses 
io tfu I ! nrnt 1 >io 3 j" t - n f 

t iin t 1 1 Mi Hi. Si a model h predicts the response of an Individual to the treatment, giver; the Individual's 
hapiotype content. 

55. The met ribuiion found foi tin ret n 
population a reduced sei of genotyping markers, which allow an individual's haplotypes to be a< ik ed without 
conducting a compL-ir n r ne analysis, and using ihe reduced set of genotype markers io determine 
haplotypes in step (h). 

56. 1 iif ' 10 i ni ; 1 . , v. \. l 
comprising ;;;■ providing a daSabase or m-she haplotypes of tlie seiecred gene from a representative cohort of individuals: 
(b x tabulai in fht f c } 1 \ n 1 
result from ail possible pairs of observed haplotypes; (d) calculating the expected frequency of these genotypes assuming 
the Hardy-Weinberg equi b! uir 1. <. h 1 ^ie length m as the 
haplotypes. wherein each mack blocks the Identity of rhe nucleotides at m-n polymorph!;; sites and adrnha rhe identity of 
nucleotides at ihe ofh ^i ' m - ' " 1 g >ong *ith ! t c 
polymorphic sites whose identity is admitted by the mask: (gj from among those masks having an acceptable level of 

L ' 1 1 <. r asjting onty the 

polymorphic sites that are admitred by ihe seiecred mack: and (P assigning to each subject having a particular n-site 
;apiotyp - - Polype of a member of the Initial cohort having the same n-site hapiotype. 

57. The method of claim 5b, wherein the calculation of ambiguity for a mask comprises (at ideriti'ving all pairs of 

ha are fc de jd identical by application of the mask; (b) calculating ihe geometric mean of ihe calculated 
Hardy- Weinberg frequencies of each pair of genotypes identified in step (a); (c) summing ail such geometric means for ail 
ambiguous pairs to obtain an ambiguity score for the mask. 

i-l ' t n t ' .. . ^ ■> j n application of the sel° <. 1 mbigi ty in that two 

3 t < iy We nberg equilibrium predicis 
probabilities PA and PB, where PA+PB=1, ihe assignment of a hapiotype pair is carried out by a process comprising (a) 
selecting a random number between 0 and 1 ; (b) if the random number is less than or equal to PA, assigning the 
pe pair A and (c) if the number is greater than PA, assigning the hapiotype pair B. 

59. A method of detei min poi jrphic sites or sub-haplotypes thai correlate with a clinical response or outcome of 

«. " % i 1 1 * > " - I -I ~s"$e ot outcome data (clinical outcome values) 

\ c Jch ndiv lial SIMP in the hapiotype for ihe degree to which it 
correlates with the clinical outcome values, and generaiing a numerical measure of ihe degree of correlation; (c) saving for 
further processing those individual SNPs whose numerical measure of the degree of correlation wiih ihe clinical outcome 
valuer; exceeds a first cut-off value; (d) generating all possible pair-wise combinations of the saved SNPs so as io provide 



a set of n-site sub-hapioiypes where n - 2: (e) statistically analyzing each newly generated n-site sub- hapiotype for the 
degree to w^ic t ! , v ' ' . . i t i t i i . e degree of 

correlation; [':) saving lor turlher processing those- n-site sub-haplotypes whose numerical measure of I he degree ot 
correlation with ins clinical outcome values exceeds the first i value; i Generating all possible pair-wise 
combinations among and between the saved SNPs and saved sub-haplotypes, to produce new subhapioiypes with 
increased values of n; fh; repeating steps [e) through ig) until either ' no new sub- haplotypes can be generated, or (ii) 
no further suit hc3f I >type s havin v. > he i 



60. The method c clai i 59, further co ing t t pfa rig those saved SNPs and sub-haplotypes whose 

numerical measure o; f degree t correlation will"; I deuce! outcome vaiue exceeds a second cut-oft value, wherein 
the second cut-off vaiue Is area tor than the first cut-oft value. 

51 . The method of claim 59, wherein the nunier:csi measure of degree of correlation ;s replaced by trie p-value for the 
correlation, and SNPs and sub- haplotypes are saved if the p-value Is less than a first cut-off value. 



iiivl xlr (In v -i - - \hs and sub-haplotypes whose p- 

value for the correlation with the clinical outcome value Is less than a second cut-off value, wherein the second cut-off 
value is less than the first selected value. 



63. The method of any one of claims 59-62, further comprising the step of excluding from further processing complex 
subhapioiypes which are constructed from smaller sub-haplotypes, where the smaller sub-haplotypes each have 
> I' ues 1 i a c . n f i 



54. A method of determining polymorphic sites or sub- hepiotypee that ooe-eiate fi a clinical response or outcome of 
interest, comprising: (a) providing - . - t "> ^ f or one oi more gee md clinical response or 

outcome da-a from a cohort of e e -1, zing each single gene hapiotype for the degree to which ii 

correlates with the clinical response or outcome of Interest, and calculating a n in s i! meas ire of the degree of 
correlation: u saving for furiher processing those haplotypes whose numerical measure of the degree of correlation wit 
fie c I 'icaf if" x n c ^ * ^ , - ^ 

polymorphic sites, generating ail possible sub-haplotypes having a single slf 
haplotypes having cn-nt sites, where n ~ i ; iei statistically analysing each newly generated sub-hapioiype for the degree 

ot correlation; (I) saving for further processing those sub-hapioiypes vviio.se numerical measure o; the degree of correlation 
with ihe clinical response or outcome of interest exceeds the first selected vaiue; (g) from the saved sub-haplotypes, 
oeneratir-a att possible sub- haplotvoes havlne one additional t mashed; 1 reoeafino steps ;et through igi until either 
(i) no new sub- haplotyr. >f correlation which exceeds h * f I < < ml i ut 

haplotypes having more unmasked cite;, Ihae a pie-selected limit can be generated. 



65. The method of ■ slain : e star. 1 ; I 

measure oft" am e r n second selected value, 

wherein the second selected value is greater than the first selected value. 



66. The method of claim 64, wherein the numerical measure of degree of correlation Is replaced by the p-value for the 
correlation, and sub-hapioiypes are saved if the p-value is less than a fi3st selected value. 



67. The method of claim 66, further comprising the step of displaying those saved sub-haplotypes whose p-vaiue for the 
correlation with the clinical response or outcome of interest is less than a second selected value, wherein the second 
selected vaiue is less i fie 'it c c e value. 



68. The met i 4-67, further compi ! m further f i 

subhapioiypes which a e rcn r ucted -r mails ib ^nj di ft i if -> v pe- has 

correlation values thai are at leasi as significant as thai of the complex sub- hapiotype. 



69. A computer- usable medium having computer-readable program code stored thereon, for causing a computer to adjust 
observed hapiotype pair frequencies within a population group, said hapiotype pair frequencies being stored in a 



computer-readable da e information for a gene or gene feature of interest, the computer-readable 

program code comprising: (a) computer-readable program code for causing a computer to access said database and 
ili 11 iii ' i c >de lor 

causing a computer to calculate the expected frequency or I generated hapiotypes and haplotype pairs according to the 
Ha d \" n^t ^ }n\ ~ o~ ipc te t ik in or ^ "dot j " 

(c) computer-readable program code for causing a computer to select the most probable haplotype pair for the individual 
based on the observed. 

70. The computef-usabse medium of claim 69, further comprising computer- readable program code stored thereon for 
causing a computer to correct the stored distribution ■:>' hapiotypes or- haplolype pairs for effects imposed by the presence 
or a Si nit ^ j r r i n t it i I ; j 

71 . The compute; -usable medium ei alarm 69, further comprising computer - readable program code stored thereon for 
causing a computer to validate haplotype pair assignments by analyzing for compliance of the assigned haplotype pair 
with 'vlendeL i i c principles. 

72. The computer-usable medium of claim 69, wherein the population is selected from the group consisting of a reference 
population ^ . stion, an ethnic population, a family population and a same-sex 
population. 

73. A computer-usable medium having computer-readable program code stored thereon, for causing haplotype pair 

jnr i to be made to an individual member o; a population whose genotype information 'or a gene or gene feature 
or interest is stored In a computer- readable form, the computer- readable program code comprising; (a) computet reada ? 
program code for causing a computer to generate all possible haplotype pairs consistent with ;k , , - jtype; (b) 
computer-readable pro> ■> > u >mg a computer to access a database containing reference haplotype pair 
freguenoy data and to determine trorn t.tre frequency data the probability, tor each of the possible haplotype pairs, that the 
individual has the possible haplotype pair; and (c) computer-readable program code for causing a computer to select the 
most probable haplotype pair for ihe individual. 

.a . (.btypr i ir 

present at a candidate locus hypothesised to be associated with the clinical response other pheuotype. the o nput i 
readable program code comprising s compute - >m; < iatabas 

containing data on clinical responses to treatments, or other phenotypes t i ■ > a ci nical population; 

(b) computer-readable piog . ^ , 1 , m.afo oidi 

x v c < r. 1 1 c ' u an c •■ 1 eli ; c >iic > present at 

ihe candidate locus: and (c) computer-readable program code tor causing a computer ;o calculate the degree of 

i >. r- - ho plio-o -> i analysis of 

ihe haplotype and clinical response data. 



76. The computer-usable medium of claim /4, wherein the candidate locus is a gene or a gene feature. 

77. The computer-usable medium of claim 74, further comprising computer- readable program code stored thereon for 
causing a computer to store, display, or output ihe degree of correlation. 

78. The computer-usable medium of claim 74, further comprising computer- readable program code stored thereon for 
causing i computer to alculal statistical sic 

79. A computer-usable medium having computer-readable program code stored thereon, for causing a computer to 

it tiiy a correlation between a i i vie. ; ;u: ;epiibii:iy tc c c 1 c y ii: ; c- < r < < ;i h tieuoty? nd a 
?lotype o jpl ityr e oar present at a candidate locus hypothesized to be associated „■ i " > i r - bill f '/n r>i 
or disease of Interest, or with a phenotype of Interest, the computer-readable program code comprising: (a) computer- 



readable program code for causing a computer to access hapiotype data for the candidate focus for each member of a 
population h 5 i t > i u x <. ^ n x f n r - disease hapiotype data"): (b) computer-readable 

program; i tor causing a compute; to statistically n if the disease hapiotype data to calculate i pc or 
hapiotype pair frequencies: \o) < 7 i m j f r - code j causing a computer > access a database containing 
api ;pe data for the candidate locus for each member of a healthy reference population ("reference hapiotype data t; id) 
computer-readable program code for causing a compu'e .t t v . . . . * . * v «. h. 

' x t, t or hapiotype pair i c ir c (•:;■ > c r x < program code for causing a computer to identify a 

correlation of a haplofvpc o h fir >c^f t -> 1 1 t i, . - i - ' '> 01 , ith the phenotype 

of interest, when the hapiotype or hapiotype pair has a higher frequency in the population having the phenotype condi ■ >n 
or disease of interest than In the reference population. 



80. The computer-usable medium of claim 79, wherein the candidate locus is a gene or a gene feature. 



81 . i he computer-usable medium of claim 79. further comprising computer- readable program code stored thereon for 
causing j or t t 



82. The computer-usable medium of claim 79, further comprising compute'- readable program code stored thereon for 
causing a computer to calculate the statistical significance of the correlation. 



83. A computer-usable medium having computer-readable program code stored thereon, for causing a computer to predict 

eatmeni based on one or more selected haplotypes or hapiotype 
pairs of ih d k ..... . x . ^ is 

a computer to access a dal x c < tc " between haplotypes or hapiotype pairs and responses to the medical or 
pharmaceutical treatment In a reference population; (b) computer-readable program code for causing a computer to locate 
haplotypes or hapiotype pairs in the database that match the selected hapiotype pairs ot the individual, and (c) computer- 
readable program code tor causing a computer to predict the; the Individual's response I be ihe response or responses 
associated in the database with the selected hapiotype or hapiotype pair. 



84. The computer- usable medium of claim 83, further comprising computer- readable program code stored thereon for 
causing a computer to generate an error estimate for the prediction. 



85. A computer-usable medium < i r i fx t o ioi 3 j 1 1 3 computer to 

display a gene's structure and gene feature c ogram cor sing: (a 

computer-readable pir ■ i 1 t 1 A 

display dovi < <t > < i - . . 1 c .ten - ; . ; . , . c r 

groupings of a reference population; (b) computer-readable program code for causing a computer lo retrieve from a 
database data indicative of the gene's structure and gene features: (c) computer- readable program code for causing a 
(.omoulei to -> . p . v . 1 i e 1 

selectable 1 t r indicating tt;e location of gene features, and graphical indicators ot ;he iooailon o; poly morphia slier; <. 
the gene; (d) computer-readable program code for causing a computer to display in a third area of the display device, in 
. l ^ >. v. >. it« the gene 

feature having user- selectable 1 indicating h position ot polymorphic 1 arid ;e) t >lr program 

code tor causing a computer to retrieve from a database, and display In 3 h 1 1 area ot the display device, 1 1 response to a 

t di^ati a o the frequencies within the 

member groupings oft'- x . o' particular nucleotides at the polymorphic site. 



J f pu 1 \ s — ^or a adat le program if 1 ^r e 

display on a display device hapiotype pa-i • . v . 3 >'» tin a population of Individuals, for a selected gene or gene 
feature, the computer-readable p;oorem socio > 1 c r ' < tat computer-readable orooram code for caustno a computer !■ 
display on the display device a plurality of selectable items,' each item corresponding 'to a polymorphic site in the gene or 
gene feature: (c) computer-readable prooram code for causing a computer to retrieve fro- 1 1 I nd display on the 
display device , in response to a user's selection of one or more items Indicating polym rp es, indi dual hapiotype 
p<3 r> in the d« - i> >" md (d) computer-readable program cos 

for causing a computet omc \ 1 0 dis.p!ay J< i i woo he frequencies o he displayed hapiotype pair; 
within one or more member groupings within the population. 



87. A computer-usable medium having computer-readable program code stored thereon, for causing a computer to 
display on a display device polymorphic site linkage data for a gene or gene structure of interest, the computer-readable 
P'xjrarr codec i i oj i :>gram code for causing a computet to display on the; display device 

one or more matrix structures, wherein the axes oi each m f / structure represent the xjL sites In the gene or 

gene feature of interest, and wherein each matrix structure corresponds to a different population or population group: and 
(b) computer-readable program code for causing a computer to display on the display device, in each cell of a matrix 
structure a graft lical tidicat c n 

r til' " t T l/ 



^ r r i ^ v ~ . . . v ^ k j it. 

t -} i f h - 1 i r t J I i ; 

3 r ei a c -j Q ale relating 

89. A computer-usable medium having computer-readable program code stored thereon, for causing a computer to 
display on a display device a phyiogeneflc tree, the computer-readable program cods comprising: (a) computer-readable 
program code fcr oa j< i Xj x oc npu c r to di >o\ » I t i 1 i i polymorphic site in 
the gene or gene ealure of inlck i c am code for causing a computer to display a 
phylogeneli< ee struc v r ach haplotype in a population, where the distance between nodes is 

J'Vt I L ! !' 'Ill ! I 1 . t ^ . I • t V. ^ A ,[X U ; 

haplotypes. 

90. The computer-usable medium of claim 89, further comprising computer- readable pmi. . m s y, ,-d thereon for 
causing a computer to display connections between the nodes that indicate a single nucleotide difference between the 
haplotypes repeserrted by the nodes. 

91 x < sir > , ; i < , iai i ~ i \ . \ r , p > i i f >k I h »i x - t n 

causing a computer to display at each node an indication of the relative frequency of occurrence of the haplotype 
represented by the node among different population groups. 

2. A computer- u ? 1 ^ 1 so 1 hereon, for ca ing a computer to 

L ). I c v. 1 ! v, l_0|f k 

3c i i i j 1 1 code; k causing a computer to display a firs; plurality of ereieroisbie i each corresponding to a 
P'lvinjrj - ^ i i " < < g fo a polymorphic site; (b) computer- 

1 1, oroniam ont. ^ ^ ^ -> ■> im \ \ < ~\ X\ 

le o' g >il odk e elected 
from the first plurality < let hems; arid (c) computei --readable program cods; tor causing a computer to display on 
fn di pi / it c n 1 1 h -r II < \ \ ! \< if bilit> of me assignment to an 

individual of the haplotype pair corresponding to the coordinates of the ceil in the matrix, when the individual is genotyped 
only at the pdymorp 3 ie< taole items. 

93 h computer rsable rnediu i d i i ^ I i on cf reliabili , of haplotype 

pair assignment, and wherein j medium further comprises computer -readable program code stored thereon 'or causing 
3 computer to display a reference color k relating r j to ret:ab:ltty of haplotype pair assignment. 

94. A c < < t 3 . Miputer fo 
display clinical response values ->i r ~c ^r-r ,c a- i j 1 < i !t m >\ r the 
individuals in the population, the computer-readable program code comprising: fa) computer -readable program code for 
causing a computer to retrieve from a computer-readable storage device, data representing haplotype pairs and clinical 
c ;por >c v, ifuc > >r c her 3 nc e i I oie or 

(using ire - i -. >. ■""to ,p- pn r- ■< „ i <~ each of whose cells contains a graphical 
representation of th - xse values or other phenotype data of individuals having the haplotype pair 

corresponding to the coordinates of that cell in the haplotype pair matrix. 

95. A computer-usable medium having computer-readable program code stored thereon, for causing a computer to 
display on a display device clinical response values, or other phnotypic data, of a subject population as a function of the 

i p liv duals in toe population for a gene or gene feature of interest, the computer-readable program 



code comprising: (at computer-readable program code for causing a computer to display one or more first selectable items 
trjjfceenli i c- ^> eature (b) computer-readable program code for causing a computer 

to i t p!a>cri£ n all > - v - 

; display on the- c - - . - 

at ieast one first and second selectable items, a haplotype pair matrix structure, wherein the axes of the matrix structure 
haplo /pes n he gene or gene feature of interest that vary at the polymorphic sites corresponding to the first 
elected iter f -\ i ti n > !i i matrix con N area o i nlcs 

i r i! r r ifcn of 

3 h i the hap! hi g t a so d type pair matrix. 

96. The computer- usable medium ! i "i "4 r I jr -> il 

response value, or other phenotype data, and wherein the medium further comprises computer-readable program code 
\,< '\ x s a i n i i >. - >. k i . 1 m> ° alue. 

r r i iii la i n r n it comonses {a) computer-readable program 

code stored (hereon for cauamg a computer to display a means for adjusfmg ;he range of mean clinical response values; or 
other pin ri i data represented by the reference color scale: end \h) compute: -readable program cede stored thereon 
for causing a computer, in response to the adjustment of the range of clinicai response values or other phenotype data 
represented by the reference color scale, to adjust the color of the cells of the hapioiype pair matrix. 

98. The computer-usable medium of claim 94 or 95, wherein the graphical representation of data is a histogram inc j 

stribution of individuals across the range of clinical response values o ■ x JcU 

99. The computer-usable medium of any one of claims 94,95. or 96. wherein at least one cell in the displayed matrix 

t ' . . ' er comprises computer-readable program code stored thereon 

i i i ! n i c h jr. a c -ted by ft f il !! in the 

matrix, a histogram indicating the distribution of the individuals across the range of clinical response values. 

I" 'tori,! r , r computer-readable program 

code stored thereon for causing a computer to display a third selectable item, and computer-readable program code 
stored then > ^ electable item by the user, the 

statistical significance of e correlations between var:af:or: af c . c * I polymorphic i < and the i ti A response 
values. 

101. The computer-usable medium c any one of claims 94.95, or 95, >,vh:ch t comprise:; r it t if program 
or er t n v j>i gaconpuerto i i > j c -fable tern anc It orogram code 

stored thereon for causing a computer to display, in response to selection of the fourth selectable item lay the user, the 
u k rl i ^ >. i ii rl i the 

102. The )tr !k j 3 medium of any one of claims 94.95, or 95, which further comprise:; computer -readable program 

p^e to is->r> 3 i ' and computer-readable program code stored 

thereon for causing a computer to display, in response to selection of the fifth selectable item by the user, the results of an 
i lj i t i iJ> he h°i a il n l a i" onse values between 

. , - 103 i\ compute usable medium having compute - 

readable program code storeo trmioo or i,'*'/? ri r t m \ U it o j| so; of 

weights to fit a function of polymorphic site data for a gene or gene feature of interest to a clinical response measurement, 

.aol 43 orogram code for causing a computer to display 
a variable controller for setting the number of genetic algorithm generations pars-meter; ib) computer-readable program 
i 1 t i i t if r r r r i r) computer- 

. a computer to display a variable controller for setting the mutation rate parameter; rdt 
computer-readable program cod- arcaus g a computer to disp iafc I i t fi ra 

i r 1 c 4 la r >r >chct3ok I cms each 

corresponding to a pol r t n if - <■ 

causing a computer to displaying a selectable item for initiation of the genetic algorithm calculation: and (g) computer- 
readable program cede for causing a computer, In response to the selection by the user of one; or more selectable items 



corresponding to a polymorphic site, and selection by ihe user of ihe item for initiation of the genetic algorithm caciuiaticn, 

0 „v <. i h ."ii" I'n. i hi' L„ t t 1 1 Lj hi. m t. ^ i >. . 

\iik (i) lit residua! error o' h model as a in n 1 the number o; genetic algorithm generations, and mi the resuHs 
- 3/ ik - 7 il il -,,ii - or t ghts for each of the polymorphic sites. 

1 A A ;ompu usable mod r a\ } ootnput ; la! ie prog n cods I ;d theteo 1 ' ismg a oompi if) 
display on a display device correlations between clinical outcome <. ak ->t . - -I m : .• oi ^ tumjik 

j olo^t f ^ i ~f -> h oiogram code for 

causing a comoute' to UoplaN L le tens conespondi ^g to clinical outcome measurements; 7) (b) 

m it c 3 t fx able lie ns corn: sp )i 

ii i I f ii i i i i h itk 

plot of data points, each data point corresponding to an individual in the selected population; 9) (d) computer-readable 
(. ) jr an "ode c ' an a ■•: a "ompuier, in response to selection by the user of an item from among the first plurality of 
el< tab! i £ n> > < t nr vaku k 

hr oi t d in i i t 1 i c k 

program code for causing the computer, in response to selection by the user of an item from among ihe second plurality of 
ilntk i c t =>! > f m< aic pk i i ii I, clitiii a! c Jtcome value; for 

the associated individual from the cl iii' - , — 

105. A computer-usable medium having computer- readable program code stored thereon, tor causing a computer to 
provide information of use in conducting a clinical trial of a treatment protocol for a medical condition of interest, the 
computer-readable program code comprising: (a) computer-readable program code for causing a computer to access a 
database of DNA sequence data for selected genes or other loci in a reference population oi individuals, and to access a 
database of i) t i l < i I \ 5 ■> > c he i > i i I i i puiation of 

i -d vk i,;h, ;fc n \ t *'it fiuin ' n v x for causing a computer to assign to each member or the reference 

popuht hi v. -t "e- c or a - p n 

computer to calculate the itPt l ani n Ii it m 

each o! the assigned haplotypes n the reierence population; id; oompuier-readabie ptogrcm code tor causing a computer 
to assign to each member of a trial population 1 1 - t t er loci, based upon thi 

frequencies, population distributions and statistical measures calculated In the reference population; (e) computer- 
readable program code for caus t . . v. v > - i 
treatment and Individual haplotypes, for each of the selected genes or other loci; (f) computer- readable program code for 
causing a computer to accept as input an individual's DNA sequence data or haplotypes for one or more of the selected 
genes or other loci, and (g) computer-readable program code tor causing a computer to display or output i expected 
response i the individual to the r tit based on the determined correlations between i ii i responses k 
treatment and Individual haplotypes. 

108. Thecorr o; claim 105 i imp i >l nam code stored 

thereon for causing a co hi r k j J i J t 

of genofyt ^ i t! - ^ dm " " t - a complete 

' .. c computer-readable program code stored thereon for causing a computer to use the 

;duc enemy? i 

107. A compute i u' rsedium g c A I hereon, ft irtg a computer jo Inf 

t polymorphic sites, the computer-readable program 
codecomprising: (a) computer-readable program code for causing a computer to access a database of m-site haplotypes 
of the selected gene from a repre e a - ^ " r ^ arising a 

oornpul i I a at i , , . N . . i - - c if i r 1 ' : ii.fi 

causing a computer to construct a lis; of all genotypes lhat could result from all doss bk pai of observed h pk /pes j 
computer-readable program code for causing a computer io calculate the expected frequency of these genotypes 
at -r ' j 1 1' ;g:am code tor causing a computer to generate a 

i 1 i n r •< ■< cj Till f of the 

nucleotides at m-n polymorphic sites and admits the identity of nucleotides a; the ot^cr n site < or- >r 1c readable 
program code for causing a computer to for calculate, for each mask, how much ambiguity results from genotyping with 
only the n poly t -y is admitted by ihe mask; (g) computer-readable program code for causing a 

computer to >, ft < device the calculated ambiguity for one or more masks. 



108 T 9 comput t -^J i r slaim 10 whic irther comprises computer-readable program code stored thereon 

for causing a computer to calculate the level of ambiguity for a mask, the computer-readable program code comprising, (ai 
> ) c it i c at p oc, : v code for ; r . \, a xiqi ' o c nf S aii pairs c genotypes ; are c x amd dent cal by 

, I it T l 3 ^ Sit i it i < i r I 3t n r Jl i Jt - j k X "I ' u ir 3 "i 

the calculated Hardy- Weinberg frequencies of each pair of genotypes rendered identical by application of the mask; (c) 
computer-readable program code for causing a computer to sum a!! such geometric means for all ambiguous pairs to 
obtain an ambiguity score for the mask. 

109. The computer-usable medium of claims 107 or 108, which further comprises computer-readable program code stored 
he ex Met rig ompul k >e ? i 1 ambiuuc xtoiype, the con uk 
readable program code . i i o >t > 1 i it ! r i r i causing a comnuk-r So calculate for two 

po pairs A and B that could Pxoiain a 9 ve' 1 fjeno , - r J e PA and PB, 

it P L \ f [■> 1 t >. 
comon -> n 3 i c 5 < ji il k P<\ 

nn i i he number eater than PA. assigning the haplotype pair B. 

1 1 0. A computer-usable medium having computer-readable program code stored thereon, for causing a computer to 
d ie n le pof^ - tilotypes ;ha aie with a clin spense c est, or othe 
phenotype, the computer-readable program code comprising: (a) computer-readable program code for causing a 
computer to access a database containing haplotype ; c n and clinical response or outcome data -conical outcome 
values) or other phenotype data, from a cohort of subjects; (b) computer-readable program code for causing a computer to 
statistically analyze each individual SNP in the haplotype for the degree to which it correlates with the clinical outcome 
values or other phenotype data, and generating a numerical measure of the degree of correlation; (c) computer-readable 
program code for causing a compute; to stote i further processing these individual SNPs ariose n c ! < tin of 
the degree of correlation with the conical outcome values or other phenotype data exceeds a first cut-off value; (d) 
computer-readable program code for causing a computer to generate all possible pair-wise combinations of the saved 
SNPs so as to provide a set of n-site sub-haplotypes where n = 2; (e) computer-readable program code for causing a 
computer to statistically srisiy.se sash ' generated n-slk: sub-hapfoiype tor the degree to which :t oorreistes with the 
clinical outcome values or other p motype ; jte ; " vih"", » of the degree of correlation; (f) 
computer-readable program code for causing a computer to store for further processing those n-site sub-haplotypes 
whose njm it • . . ot me degree of correlation exceeds the first cut-off value; (g) computer-readable program code 
sot causing a computer to generate aii poss )ie pa >e combinations among and br v i J i x NPs and saved 
sub-haplotypes, to produce new subhapioiypes with increased values of n; (h) computer-readable program code for 
causing a compute to tpe t ^ <. j rs t i f i j 
further sub vjIIji nm x -er<\ta 

111. The computer-usable medium of claim 1 1 0, which further comprises computer-readable program code stored thereon 
tor sausmg a computer to display those saves' SNPs and sub-haplotypes whose numerical measure of the degree of 

oirt 5 i io with the clinics;! outcome value or other phenotype exseods a second out ■ of: value, wherem the second i off 
value is greater than the first cut-off value. 

1 ! <- "> 1 1 I x i x npu oi 

determine pok i ? j i s - r sterest he 

p n >tv>ff hf if i i program socio comprising, (a) sompuier- readable program cork- for causing 

computer to access a database containing haplotype Information, and clinical response or outcome data (clinical outcome 
values) or other phenotype data, from a cohort of subjects; (b) computer-readable program code for causing a computer to 
statistically analyse each individual SNP m the hapiotype for the degree fo whfch it correlates with the clinical outcome 

i table program 

code for causing a romo < t r , « x c , > -j 1 - I- h,,; i ir degmeof 

> ■■>■ ' itation does not exceed a first cut-off value; (d) computer-readable program code for causing a computer fo generate 
all possible pair-wise combinations of the saved SNPs so as io provide a set of n-site sub-haplotypes where n = 2; (e) 
computer-readable program code for causing a computer io statistically analyse each newly generated n-site sub- 

5 relates with the- clmicai ou . - ^ , , x x ~ ~ ; 
p-value for the degree of correlation; (f) computer-readable program code for causing a computer to store for further 

- . ceo whose p-value for the degree of correlation does not exceed the fitst cut-off value; 
(g) computer-readable program code for causing a computer to generate all possible pair-wise combinations among and 
between the saved SNPs and saved sub-haplotypes, to produce now s - > vi h increased values of n ; (hi 

computer-readable program code for causing a computer io repeat steps (e) through (g) until either (i) no new sub- 



i pes can be generated, or (ii) no further sub-hap!oiypes having n less than a pre-selecied or user-selected limit can 

I. y-J ;', .tOl 

113. The computer-usable medium of claim 1 t 0, which further comprises computer- readable program code stored thereon 
n.ii iv j r i jte saved SNPs and sub-haplotypes whose p-value for the degree of correlation with 

the oimioai outcome value or other phenotype does no! exceed a -second cut-off value, wherein she eocene cut-off value Is 
less than fee fire; cut-off value 



114. The computer-usable medium of claims 1 10-113. which further composes computer-readable program code stored 
•hereon for causing a computer to exclude from further processing complex subhapio types which are constructed from 
smaller sub-hapiolypes, where the smalle; euo-haplolypes each have correlation values thai are at least as iy< i (I as 
that of the complex sub- hapiotype. 

115. A computer-usable medium having computer-readable program code stored thereon, for causing a computer to 
< l < < i. ,1 ii i hoi 

p not p of interest, the computer- readable program code comprising: (a) computer -readable program code for causing 
a computer to access a database on nir ^ ^ - ^ 

response, oureeme Para, o? other phenol ype data from a cohort et sniijecfs; tip computer-readable program code for 
causing a computer to statistically analyze each single gene hapiotype for the degree to which a correlates with l he clinical 
response, outcome, or phenolype of interest, and to generate a numerical measure oi the degree cf correlation; (c) 
computer-readable program code for causing a computer to store for furthei processing 

numerical measure of the degree of correlation exceeds a first cut-off valur <. .r ' ,. Jable program code for 

in. 1 . ^ ^ x . >ub luploiypes having 

a single site masked, so as to provide a set of m-n site sub-haplotypes vvhoic n = 1 c- jmpuU ; iable progiam code 
for causing a computer to statistically analyze each newly generated sub-hapiotype for the degree to which It correlates 
with the clinical response, outcome, or phenotype of interest, and calculating a numerical measure of the degree of 

t UK I 3 I I 1 I I t f t I >< J0-- 

hapiotypes whose numerical rrcasu e of the dcgrer ot r ^ t i ;u n ) renpeioi -readable 

program code for causing a computer to generate, from the saved sub-haplotypes, all possible sub- haplotypes having 
k ti ) 3 . , . . e iuuqh (g 

until either (i) no new sub hapk v, ^ t a it s n i ik lurtlK 

sub-haplotypes having more unmasked sites than a pre-selected limit can be generated. 

1 1 ">n t i la! m 5 ther cot omputei oacath p ogram < t I 

or causing a cornp >h teletypes whose numerical measure of the degree of correlation with 

!m i " >. „ ^ >- a -t v <- j » ... 

i e v £ ;a1< thai he fn t ;u! oft value. 

117. A computer-usable medium ha nui'omout lets puter 

cete nine p jl/nc f h c . n f r t s hi 

lie program code for causing 

a computer to access a database containing single gene hapiotype Information for one or more genes, and clinical 
r. 4 N >< r. r i - i i t . . 1 i , v . . c-gram code lot 

s h single gene haplo ,\ - ,,<■', • . nnh , ' t \ ' I w 

response, outcome, or phenotype of interest, and to calculate the p-vaSue for the degree otcoi relation; (c) computer- 

1 v . causing a computer to store for further process g thos apiofypes ese p-v or the 
dec c on doe xceed ' aiue: (d) come iable program c< using a comput 

"> "> v - ^ ■> ' I \ II I i ! f 

so as to provide a set of m-n site sub-haplory pes where - S . (e) computer-readable program code for causing a 

mi Jt I j t t ij. j i k hi c ales with the clinical 

response, outcome, or phenotype of ink re j t la r omputer- 

^ ■> a is ng a computer to save for ?urth« ? 

^ o f 3 ■= yi ~~ipi <=> e- 1 j - - n » 

to generate, from the saved sub-haplotypes, all possible sub- haplotypes having one additional site masked; (h) computer- 
sod* k cau '33c >e: jter tc repeal steps ■!«) ihroug 1 fgj r pioiypes ha\ 
a p- value which does no; the first cut-off value, or (ii) no further sub-haplotypes having more unmasked sites than a pre- 
selected limit can be generated. 



118. The comput sa ^ 1 < r lalm 11 » k.rther comprises computer-readable program code stored thereon 

o ! v I -> - t t. <. h,"k , =■ <. 

response, outcome, c?r ohenoiype ! interest does riot exceed a second cup off value. wherem ihe second out-oli it c i 
less than the first cut-off value. 



thereon for causing a computer to exclude from further processing complex sub-hapiotypes which are constructed from 
smailei -irt c " & cc relation values that are at least as significant as 

that of the complex sub- hapioiype. 

120. A computer progran ned to >. i < Oe made to an individual mei i t 
whose genotype information for a gene or gene feature of interest is stored in a computer-readable form, ihe computer 

c >sc r or 

executing the program code stored in memory, wherein the program code includes: computer-readable program code for 
causing a computer to generate all possible haplotype paii£ con£ i >k 

program code tor causing a compile; to c:aicuia;e ihe frequency of the hapiolypes and hapioiype pairs according So ihe 
Hardy- Weinberg equilibrium, based upon the ob$o'< xi c i.t i on > n 1 1 > ^ o. lo i ji - ">ui - < d 

computer-readable program code for causing a computer to select the most probable haplotype pair for the individual. 

121 . The computer of claim 1 20, wherein the program code further includes computer- readable program code for causing 
a computer to correct the stored distribution of haplo types or haplotype pairs for effects imposed by the presence of a 
limited "unt l j hi the population. 

122. The computer of claim 120, wherein the program code further includes computer-readable program code for causing 
a computer to validate haplotype pair assignments by analyzing for compliance of the assigned haplotype pair with 
Aende t - p!< s 

1 ] < nr i ik ! 1 1 1 i pulsion, a 

clinical population, a disease population, an ethnic population, a family population and a same-sex population. 

124. A computer programmed to cause haplotype pair assignments to be made to an individual member of a population 
whose genotype information for a gene or gene feature of interest is stored in a computer-readable form, the computer 
comprising a memory having at least one region for storing computer executable program code and a processor for 

causing a computer to generate at: possible hapioiype pairs consistent with the stored genotype, computer- readable 
program rodt i j I r p J i i 

determine fron ipe j ? 3 se p at ihe indi kial h 

possible hapic x>mputer-readabie p jgram cc r ca - c 1 k i the most probable 

haplotype pair for the individual. ' 

125 \ c 1 r £ grammed k - < i < ' - In 1 ,<; v i a 

v v v v •> - ^d with the clinical response other 

phenotype, the computer comprising a memory having at least one region for storing computer executable program code 
and a processor for executing the program code stored in memory, wherein the program code Includes: (a) computer- 
readable program code lor causing a computer k access a daisbase containing dak; < cilmcai responses to treatments, 
or other phenotypes, exhibited by individuals in a clinical popula on c i rogram code for causing a 

computer to access a database containing haplotype data for each individual of the clinical popula; on, the hapioiype data 
comprising information on a plurality of polymorphic sites present at the candidate locus; and (c) computer-readable 
program code for causing a computer to caicuiaie ihe degree of correlation between hapio;ypes o; hapio;ype pairs and she; 
clinical csp r r - r ~ t til 1 ! h f k s n t 

128. The com? 1 serein th* nent comp i tlon c a drug or drug candidate. 



127. The computer of claim 125, wherein ihe candidate locus is a gene or a gene feature. 



1 28. The computer of claim 1 25, wherein the program code further includes computer- readable program code for causing 
i ur\ 1 i store, dupiay or output h degree 1 rone! ahem 



129. i no computer of claim ! vvhereln the program code further includes computer-readable program cede for causing 

ert I i tafisticaf significance of the cc 'elation. 

130. A computer programmec te ide~ i eaiu ^ 

interest, or other phenoiype, and a hapiotype or hapiotype pair present at a " , to ^ hypothesized to be associated 
with u c ,. nil to the condition or disease 1 interest, or whh a phenotype ot interest, t t \ i i r n s 
memory having at least one region for storing computer executable program code and a processor for executing the 
[ r r i >ioo >< > , t ^ro i i f ^ i ^ id' i 

computer to access hapiotype data for the candidate locus for each member of a population having the phenotype or 
i I ii i a i xtobk program cc c > c t 

a St - ~\ } zr 1 — - _ r p U j f 

i. A jijhi f n,ng hapiotype da a <. t l c ?i Hid ite locus for 

Mo I' < h i lu . ' N ^ . ^ \ . . < 1 

u in ->i t t i r,|| tit n 3p!otype data to calculate hapiotype or hapiotype pair 

frequencies; and (e) computer-readable program code for causing a computer to identify a correlation of a hapiotype or 

irbUj. > h i } \ ik „ . ! >. t ^ h 5 the 

hapiotype or haptolype pair has a higher frequency In ;he population having ;he phenotype, condition or disease of interest 
than in the reference population. 

131 . The computer of claim 130, wherein the candidate locus is a gene or a gene feature. 

132. The computer of claim 130, wherein the program code further includes computer-readable program code for causing 
a computer to store, display, or output the identified correlation. 

133. The computer of claim 130, wherein the program code further includes computer -readable program code for causing 
3 i I 1 o' he correlation. 

134. 'ill programmed to predict an mdividuat's response < a medical or pharmaceutics; i en ! based on one 
or men t i r k i r 

on o _ on r. -> , -> ij rno^^i - , m memory , 

wherein the program code include ; . ..ter-readable program code for causing a computer to access a database of 

correlations beiwecn haplotyp es or ' jototype pji t < i t it iia 

reference population: {b) computer -readable program code tor causing a computer to locate haplotypes or hapiotype 
in the database that match the selected hapiotypes or hapiotype pairs of the individual, and (c) computer-readable 
program cj i <_ 1 k. * ^ . v - f 1 x =■ <. y < ... 

associated m the database with the selected haplotvoe or . 0, a pair. 

f 35. The computer o n 134, w ; c r ^ - t , - , , > ia > , 

a computer to generate an error estimate for the prediction. 

13 b. A computer programmed to d:splay a gene's structure and gene features on a display dev:c.e. the computer 

cr^ i -c-ou^ cereln e u fw" ~m -o i ! ice c r ram code 

for causing a computer to retrieve from a database, and display in a first area of the display device, data indicative of the 
frequencies of ■occurrence of a gene's haplotypes wthin preo'etui mined member groupings of a reference population; { t ; ) 

structure and gene features; (c) computer- readable program code for causing a computer to display In a second area of 
the display device a graphical representation of the gene's structure, user-selectable' items indicating the location of gene 

N , o - the location of polymorphic sites on the gene ,'c , m, 1 ' am code 

for causing a computer to display In a third area of the display device, in response to a user's selection of an item 
indicating a gene feature, a graphical representation of the structure of the gene feature having user- selectable items 
indicating the position of polymorphic sites; and (e) computer-readable program code for causing a computer to retrieve 
from a database, and display in a third area of the display device, in response cause s selection of an Item indie; 



the position of a polymorphic site, data indicative of the frequencies within the member groupings of the occurrence of 
sarticul \, A it 1 om \ 

137. A computer programmed to display on a display device hapiotype pair frequency data within a population of 
individuals, for a selected gene or gene feature, the computer comprising a memory having at least one region, for storing 
computer executable program c -c ; 3 ; processor k executing . program code stored in memory, si he 
program code Includes: (3) ji i i it i program code tor causing a computer to display on the display device a 

selectable Items, each item corresponding to a polymorphic site in the gene or gene feature; (c) computer- 
readable program code for causing a computer to retrieve from a database and display on the display device, in response 
ills i I fi t differ 

a: one or more of t selected polymorphic. sites: and 1 ii computer readable program cods; tor causing a computer to 
1 ph - ^ -L Ja - 1 =J hapiotype pairs within one or more 

member groupings within the population. 

138. A computer programmed to display on a display device polymorphic site linkage data for a gene or gene structure of 
interest, the; compute; composing a memory having at leas! one region for storing computer executable prog mm code and 
) |/oc I <> v . - v , < '1 1 "a m jr ±\ > 
program code for causing a computer to display on the display device one or more matrix structures, wherein the axes of 
each matrix structure represent the polymorphic sites in the gene or gene feature of interest, and wherein each matrix 

ti! 010 C f. k I f ( 1 i f ( i , > . lid ^ I ) I . d I . It I C 31, II 

computer to display on the dtsp hv Hevco - or ,h x a 1 r - ix ; c m , ' x sjioe of linkage 

between the twp polymorphic sites corresponding to the coordinates of the eel! in the matrix, 

M ' X MP \l . , " ■■ ■■ ^ \ v i i 

medium further comprises computer-readable program code for causing a computer to display a reference color scale 
relating color to degree of linkage. 

140, A computer programmed to display on a display device a phylogenetic tree, the computer comprising a memory 

3 ■■etc gram code 

stored in memory, wherein the program code Includes: (a) computer- readable program code for causing a computer to 
display a plurality of selectable Items, each corresponding to a polymorphic site in the gone or gene feature of interest; 

^ jrugram code for cai . ^ " . fi g a node for 

each hapiotype in a population, where the distance between nodes ;s proportional 1 3 the minimum number of nucleotides 
hat would I 3 to fx 

14" 3e computer of claim 14 1 ible program code for causing 

a computer to display connect , t <- r < t m t 3 1 r 

repesented by the nodes. 

142. The computer of claim 1 40, wherein the program code further includes computer-readable program code tor causing 
a computer to display at each node an Indication of the relative freqi e of slotype repre J. 
In 1 U r t ^ v. 

143. A computer programmed to display a genotype analysis screen on a display device, the computer comprising a 
memory having at least one re b 1 jtmclno 
program eocie stored in memory, wherein the program code includes: {a; computer-readable program code tor causing a 
computer to , > -> v <mvh m - - - ~ f „ m ^hi^ " t - i a second plurality of 
selectable . " lb) computer j ' . j „ ' 

to display on the d!=.oh/ 1 s < r i t it t nt h I > /oe> in the 

gene or gone- feat, , - 1 -st fhat vary at the polymorphic, sites selected from the first plurality of selectable Items; and 
(c) computer-re npu er to display on the display device, in each cell of the matrix 

stru uie a v gn rwnt to an individual of the hapiotype pair corresponding to 

the coordinates of 1 he cell in the matrix, when the individual is genotypod only ai the polymorphic sites selected from the 
second plurality of selectable items, 

144. The computer of claim 143, wherein color is used as the graphical indication of reliability of hapiotype pair 



assignment, and wherein wherein the program code further includes computer-readable program code for causing a 
if. l l.' , t i w ^ „ 'i >, iity of haplotype pair assignment. 

145. A computer programmed to d 3 < i esponse values, or other phenotype data, of a subject population as a 
function of haplotype pairs of the individuals in the population, the computer comprising a memory having at least one 
region for storing computer executable program code and a processor ior executing the program code stored in memory, 
wherein the program code x ;des > con 3uk dab jm ? ;o g a compi 3 ve from a 

at sto age device, data representing haplotype pairs and clinical response values, or other phenotype 
data, for the subject population ax h if t c i. ' j. i b 

f i a v llx i A i K h , . N ^ C > Oon C „dllf^ 3 

other phenotype r j ,\ individuals having the hapblype pair co: responding to ihe coordinates ol tiist !! in the hapbfype 
pair matrix. 

146. A computer programmed to display on a display device clinical response values, or other phnotypic data, of a subject 

s t. bt»pe pairs o* the individuals n the population for a gt^e o k t < i teiest 

the compu er oomph I ode and a 

processoi or executing he proo rice c , < < herei the program code includes: (a) computer-readable 

program code fo n ti 3 ;lect; ?ms representing p lymorphic sites of the 

gene of gene feature; (b) computer-readable program code for causing a computer to display one or more second 
electab! ns i i j clinical i ements or phenotypes; and (c) computer-readable program code lor causing 
a computer to display o | , - 1 1 i i r | 3 n i 

selectable items, a haplotype pair matrix structure, wherein the axes of the matrix structure represent haplotypes in the 
gene or gene feature of Interest that vary at the polymorphic sites corresponding ie Ida tlrst selected hem or items, and 

< Kh t I! Il l i U 111! 3 I HI lllli I >3th< i 

pe data, for the clinical measurement represented by the selected second item, of indivk 
pair corresponding to the coordinates of the cell in the haplotype pair matrix. 

147. The computer of claim 145 or 146, wherein color is used as the graphic I ic ol nean clinical response value, 
or other phenotype data, and wherein the program code further includes computer-readable program code for causing a 
computer tot 4!), , un xf!,t - < c )r to moar- clinical response value. 

148. The computer of claim 14 ' £ n cc npi er-readabie program code for 
tan 11 4 t nr > il f , , ' - 1 

n c ntt r hj fhr 1 , - - - t < 1 i i c 1 ipi i in 1 c i n 

istment of the range of clinical response values or other phenotype data represented by the reference color scale, 
to adjust the color of the ceils of the haplotype pair matrix. 

149. The computer of claim 145 or 146. wherein the graphical representation of data is a histogram indicating the 
distribution of individuals across the range of clinical 'etpc * lx .a 

150. The computer of any one of claims 145,146, or 147, wherein at least one ceil in the displayed matrix includes a 

r 1 3clt ? ; X; lor cauemg a compu 

<• > r ,r r r n > histogram 

Indicating the distribution of the individuals across ihe range of clinical response vaiues. 

151. The computer oi any one of cteims 145,146, or 147 wherein the program code further nci - r I 

1 n code for causing a computer to display a third selectable item, and computer- readable program code for causing 
^ ^ . "c ion of the third selectable Item by the user, the statisticaf significance of the 

correlations between variation a: individual polymorphic slies i the clinical response values. 

152. The computer or any one of claims 145,146, or 147, wherein the program code further includes computer-readable 
program code k , \> 3 mpr v lt.>b >t! V - I i ; ; , -> ~ , ► b orogram code for 

f lay, in response to selection of the fourth selectable Item by the user, the numerical mean and 
standard deviation of clinical response values among individuals having each haplotype pair In the matrix. 



153. The computer of any one of claims 145,148, or 14 7. wherein the program code further includes computer-readable 

°ctable iten and computer-readable program code for causing a 
compul "■ c . >f I i i" c ii . c °i< H ' h i 1 m . " ^ . i 

lil > p 3 I malica of whet iation I il respo v-alus - .-ngd 

hapiotype pairs is statist 



11 » r nm I II I i i 1 - n i -■ . m i • > ' o I 

polymorphic site data for a gene or gene feature of interest to a clinical response measurement, the computer comprising 

i i' h-, 'i . " . . - .me executable program code and a processor tot executing the 

prograrr (.odes tort' mm, . c * N . m f. . ^ 1 c < v >it ; < 

;o nt it o r r n it; - . , ' . m , > , > v r jn oj|f 

- computer to display a variable corsiroile: foi setti c ■ c 

(c) computer-readable program code for causing a computer to display a variable controller for setting the mutation rate 
. > r i i if i i ( i n j >r „ - !< t. i ,< tir 3 the 

clo mogram code for causing a computer to display one or more selectable 
items each correspond:^ to a polymorphic site of the gene or gene feature of Interest; and (i) computer-readable program 
code ku in ii, a computer to displaying a selectable item for miration of the genetic algorithm calculation: arid (g) 
compute! u < > >f i omputer, in response to the selection by the user of one or more 

^elects r - - - - n i ^ ^ en if t-e genetic 

. , I - n i > 1 . t . i i and 

i display on a display i ill the residua! error of ;he mods:! as a function of the number of genetic algorithm 
generations, and fib the results of Ii genetic algorithm calculation: showing the optimal weights for each of the 
polymorphic sites. 



155. A computer programmed to display on a display device correlations between clinical outcome values obtained from 
selected clinical outome measures for a selected population, the computer comprising a memory having at least one 
region for storing computer executable program code and a processor for executing the program code stored in memory, 
,<vh< - e program >d< c dee: 1 1 3) compute eadable program code for causing a computer to display a first 
plurality of selectable Items corresponding to clinics! outcome measurement i.m„ - c i -> program code for 

causing a computer to display a second plurality of selectable Items corresponding to clinical outcome measurements: and 
13) (c) compute! lear \. y t r t j >. nt each data point 

corresponding to an individual in the selected popula ion 14) tdi c -> > I 'ii im i 

compute! hi \ I ie user of an item from among the first plurality of selectable items, to locate each 

data point along the \ \ . i ^ a! outcome value for the associated individual from the 

dni! il n i it rt t I t i i f i iu 113 the 

computer, in response to selection by the user of an item from among the second plurality of selectable items, to locate 
each data point along the y axis of the scatter plot according to the clinical outcome value for the associated Individual 
from the clinical measurement represented by the selected item. 



158. A computer programmed to provide information of use in conducting a clinical triaf 
medical condition of interest, the computer comprising a memory having at lea ne regio c omputer 
l i it ..... : i. i o l 1 > mo ii< -v . n e piog m xer 

- ^ . N - - 1 . . ' ' . i t t i , database of DNA sequence data for 

select genes or other ic cess a < rse of (or accept as input) DNA 

sequence data for selected g >. r il t n i 1 k pro i 

ode k i if. i > c 3ch it the eke tee 

genes or other loci: (c) computer-readable program code for causing a computer to calculate the frequencies, population 
distributions and statistical measures, Including confidence limits, for each of the assigned haplotypes in t reference 
population ili omputer-readab! program cod i 3ch membei of a iual poputaii 

haplotypes for each of the selected genes or other loci, based upon the frequencies, population rim 3 statistical 
measures calculated in the reference population: (e) computer- readable program code for causing a computer to 
< <. I j. e_ o ^ t t eat ie ,f and individual haplotypes, for each of the 

selected genes or other loci: ft; computer-readable program code for causing a computer ;o accept as input an individual's 
DNA sequence data or haplotypes for one or more of the selected genes or other loci ' - - - c readable program 

code for cau - play o; output the expected response of the individual to the treatment, based on the 

determined c Jua ee^o t_ he eatme t and ndi dual apiotypes. 



157. The computer of claim 156, wherein the program code further includes: (a) computer-readable program code for 



causing a compute to o t - ( - v x^ a ^ i fx ^ l p - ^ ^ ^ ^ ^ 

genotyping mart ; whit aitow an dividuai's haplotypes to be accurately predicted without conducting a complete 
molecular haplotype analyax: and ;b; computer-readable program code for cauaing a compute; to use Six:- reduced set q\ 
genotype ma e c ask h pk x - 



158. A computer programmed to it genotypes oi n ii t I subjects; k a selected gene having i leas; 1 I it 3tf 
sites, the computer comprising a memory having at teas! one region for storing compute; executable program code arid a 
processor for executing the program code stored in memory, wherein the program code includes: (a) computer-readable 
program code for causing a computer to access a database of m-stte hapiotypes of the selected gene from a 
representative cohort o; individuals; ;b; compute: -readable program cede lot cauaing a computer to tabulate the frequency 
ti r lii! tiurj a list of 

-X i aufwl-mu - - aotvi-r^ mi ->ipu <x o~„ u; inmoojofn 

causing a computer to calculate the expected frequency of these genotypes assuming the Ha t. v ^ .an u 

computer-readable noxxi. >d^ >rca. c t. 'UJt > x etc c c - >c < c c die x A\> iri»k' offhf same 
length t as the hapiotypes, wherein each mask blocks the identity oi the nucleotides at mm polymorphic sties and admits 
the Identity of nucleotides at the other n sites; (!) computer-readable program code for causing a computer to for calculate, 
tot each mask, how much ambmuity resutts from gene-typing with only the n polymorphs s:les whose identity is admitted 
yy ' ' ~g a computer to output or display on a display device the 

calculate \ a 3 more masks. 



159. The computer o; claim 158, wherein the prog-am code further includes computer- readable program code tor causing 
a computer to calculate the level of ambiguity for a mask, the computer-readable program code comprising: (a) computer- 
readable program code for caus tig a ^ rr er to i itify all pairs of genotypes that are rendered identical by application 

ih n > tt 1 r if t 1 i tr n f Hi 

calculated Hardy- W-^ x c s of each pair of genotypes rendered identical by application of the mask; (c) 

computer-readable program code tor causing a computer to sum a!! such geometric means for all ambiguous pairs to 
obtain an ambiguity score for the mask. 



p icjiair i k r if m - ' j il < - 11 it,fx In 

nitl jilt I K i ! 3!- f r t I 

01 t> 'o k mo f I " - PA and 

PB, where PA •- Pt3 ! . L computer-readable program code tor causing a computer to assign a haplotype pair by a 

» 1 an or equal to PA, 

assign m ^ N v , xi ir \ir \ r > c, eater than PA. assigning the haplotype pair B. 



161. A computer p . t h< >u It i ponse or 

outcome of interest, or other phenoiype the comp c « j 

computer executable program code and a process -> .r code stored In memory, wherein the 

program code Includes: (a; compute; -readable program -code for causing a computer to access a database containing 
haplotype ink i x x >r c >r I 1 p <. t o< oda rem a 

<- j ho rj 1 <- r it 11 1 t it t i I r jjj m it, 1 Ju )5 

SNP in the haplotype for the degree to which it correlates with the clinical outcome values or other phenotype data, and 
generaimg a numerics! measure of the degree oi correlation; _. computer-readable program code fx causing a computer 
to store for further prom m N , > a numerical measure 0 t r 10-ivth the 

clinical outcome values or other phenotype data exceeds a ut-o value comp t am r 

causing a c > erate ail possible pair wise combinations of the saved SNPs so as to provide a set ot rxslte sub 

haplotypes where 1 ^ a; (et ii 1 program code to; causing a computer to statistically analyze each newly 

<- - - f \ -> - x , x ^ ■> v , tie ci i<- o X^cr phenotype 

data, and calculate a numerical measure of the degree of correlation; (?) computer-readable program code for causing a 
computer to more for further processing those n-site sub-haplotypes whose numerical measure of the degree of 
oorrela ion exa* 1 5 firs cut >f Jt iue c put £ 1 5 n code for causing a comput 0 1 1 t =3 
possible pair-wise combinations among and between the saved SNPs and saved sub- haplotypes. to produce new 

crea d vah oi n w ompute readable program code for causing a computer to repeat steps 
N ugh (g) until either (I) no new sub-haplotypes can be generated, or (ii) no further sub-hapiotypes having n tess than 
a pre-seleoted or user-selected limit can be generated. 



162. The computer or claim 161, wherein the program code further includes computer-readable program code tor causing 
i tepl ho i SNPs and sub-haplotypes whose numerical measure of the degree of correlation with 

im c c ai outcome a - or > c i y >t ; f. e exceeds a > c ' i cur- off value, wherein the second c value Is greater 
than the first cut-off value. 



e program 



haplolype Information, and chnlcai response or outcome data (cllmca! outcome values.) or other pheneiype data. Iron; a 

SNP In rhe hapietype tor the degree to which if correlates with the clinical outcome values or other phenoiype data, and 
calculate the p-value for the degree of correlation; (c) computer-readable program code for causing a computer io store for 
further prx . t 1 

(ti) computer-readable program code for causing a computer to generate all possible pair-wise combinations of the saved 
SNPs so as to provide a set of n-siie sub-iiapbtypes where n » 2; (e) computer-readable program code for causing a 
compute, tii i t „ if c > J q c ili i. fi 

clinical outran k r h n i n (ft computer- 

b!e program code for causing a computer to store for further processing those a- site sub-haplotypes whose p-value 
for Ihe degree of correlation doe:; , exceed t firs; cut-off value; t computer-readable program code for causmg a 
computer > generate all f ' > c pan-wise combinations among an;; between the n SNPs and caved sub- 

ncreased values of n; (h) computer -readable program code for causing a 
computer to repeat steps (e) througi (g) until eithe i jlolypes can b ?raie irther sut 



1 64. The computer of claim 1 61 , wherein the program code further includes computer-readable program code for causing 
a computer to display those saved SNPs and sub-hapiotypes whose p-value for the degree of c • elation v I ihe clinical 
l r h i ^ . \ . . . ii ii , nil if i <. i I than 

the first cut-off value. 



165, Tne compuk v 

oouo f urns ^ ~ ert from 

smaller sub-hapiotypes, where the smaller sub-haplotypes each have correlation values that are at least as significant as 
that of the complex sub-hapbtype. 

166. A computer programme^ c c h arrela vith j response or 
outcorroo M . ^ - . ^ . - \ < , >i < < y , i !< t < y t m c 

f i n v m - ^ \ , - t r i j) >!<• a r y> / t m 

rc n oc nc les (at computer-readable program code tor causing a computer io access a database containing 
haf j orrr 3i- one o nore genes anc : ome data, or c tie phenotype data 

i i i > it 3lly anaiy £ oac h 

gene hapiofype for :he degree to which If. correlates h ihe clinical response, outcome, or phenotype of Intorosr, m to 
e deg ee of correlation; (c) computer-readable program code for causing a computer 
n »:i, ■ , n ■ > * . ^ ^ ^ > f V 3 , c y . . y < v. x > a n t U 

) v , - roil im code for causing a computer 1o generate, for each hapiotype composed oi m 

ng a single site masked, so as to provide a set of m-n site sub- 
t- es whe'e n = 1 let computet loadable p (-gram code -or causing a computei j i ; nalyze each newly 

generated sub-haploiype for the degree to which h correlates with the ellmoal fesponse, outcome, or pheriob/pe of mierest, 
and calculating a numerical measure of the degree of correlation: (h computer-readable c ' i - \i <- , f 3 
computer to save for further processing those sub-hapiotypes whose numerical measure of the degree of correlation 
exceeds the first c >. ^ f i < a e from the saved 

sub-haploiypes, all possible sub- haplolypes having one additional sits: masked; (h) <. 1 i 1 3 program code for 

- £ x > r op - hiou^ ig) , nt n h, - -r -ur :c -t^ps <- - n t d, hi 
which exceeds the first cut-off value, or (ii) no further sub-hapiotypes having more unmasked sites than a pre-selected 
limit can be generated. 



167, The computer of claim 166, wherein ihe program code further includes computer-readable program code for causing 



a computer to display those saved sub-hapiotypes whose numerical measure of the degree of correlation with the clinical 
response data, outcome value, or other phenotype data exceeds a second cut- off value, wherein the second cut-off value 
; great nan the lost outx vafu< 



ilc ifa 



jeUsihon i - 3 c3 c >nf uterk e-Ui i )k 

{ - CHO£ I \ t i r I mr k 

3 for the degree of correlation; (c) computer-readable program code for causing a computer to store fo 
lose haplotypes whose p-vaiue for the degree of correlation does not exceed a first cut-off value; (d) 

i c. r ! i f c 3ch )b >p ) c of m c lymorphic 

b 3pht)pt < i n ---- 

•Jable program code for causing a computer to statistically analyze each newly generated sub- 



for fu her pm^ f re rijf off 

value; ] computer-readable program code for causing a computer to generate, 1 the saved h [ k , i til 
possible i tit M)Ouu 3,!i i] x Jjcu^s" :omputer-readabie program code for causing a computer 
to repeat steps (o f r/ o i - iher (n no new sub- haplotypes have a p-vaiue which does not the first cut-oif value, 
or (ii) no further sub-hapiotypes I laving more unmasked sites than a pre-selected limit can be generated. 



169. The computer of claim 1tv , - •• h nogram code further includes computer-readable prog ; i code o: causing 
a computer to display those saved sub-hapiotypes whose p-value for the degree of correlation with the clinical response, 
outcome, or phenotype of interest does not exceed a second cut-off vaiue, wherein the second cut-off value Is less than 
the first cut-off value. 



170 l"h« < 3 rr uk i i i ! > if. t i i r 

ode I , u - ■• - ^ \ i iiv itlriifm 

smaller sub-hapiotypes, where the smaller sub-hapiotypes each have correlation values that are at least as significant as 



171. A cola structure for storing sod organizing bioiogioai information, stored on a computer-readable medium and 
accessible by a processor, which composes a single parent table which is adapted for storing, organizing, and retrieving a 
plurality of genetic features by the relative positional relationships between the genetic lectures. 



172. The data structure of claim 171, whe . N i ^ -ok -ou . of three submodels comprising the data 
structure whereir ;c i submodel >r I, a variation reposit < and a literature 
repository submodel. 

173. The data structure of claim 1 

chromosomes, genomic regions, genes, gene regions, gene transcripts, transcript regions, and polymorphisms. 



174. The si cialn i a cimi >ry submodel 



1 '5 ]"h{ da a structu so a m 1 - i 1 ipri 1 



17 r ix ix 5 i c I x ; rl nf 3 v ; o- j- hu ' co r >i i c t j ; comprising 

a single parent i which is adapted for storing, organizing, and retrieving a plurality of genetic features by the relative 
positional relationships between the genetic features; and (b) positioning a first genetic feature onto a second genetic 
feature. 



177. The method of claim 175. wherein said first genetic feature is an assembly and said second genetic feature is a gene. 



178. The method of claim 1 77, further comprising positioning a third genetic feature onto said gene. 

1 79. T^c mot In ' i r i 1 1 t r comprises 
positioning onto said gene region a polymorphism. 

180. The method of claim 1 79, furl her comprising providing a relationship between the polymorphism and at least one 
I -■ noUf c » r oak - h n 

181. Tne mcthoi - i t -i - - - \, \ ng onto said ge ; a hapiotype which comprises a plurality of 
polymorphisms. 

182. The method of claim 1 78, further comprising providing a relationship between the hapiotype and at least one 
phenotype which is associated with the hapiotype. 

183. A data structure for storing and organizing biological information, stored on a computer-readable medium and 
accessible by a processor, which comprises at least two different fields, one of which includes a plurality of genetic 
eatui c rid e other o ' eiweeii the ge 
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